Voice conversion by codebook mapping of line spectral frequencies and excitation spectrum

نویسندگان

  • Levent M. Arslan
  • David Talkin
چکیده

This paper presents a new scheme for developing a voice conversion system that modiies the utterance of a source speaker to sound like speech from a target speaker. We refer to the method as Speaker Transformation Algorithm using Segmen-tal Codebooks (STASC). Two new methods are described to perform the transformation of vocal tract and glottal excita-tion characteristics across speakers. In addition, the source speaker's general prosodic characteristics are modiied using timescale and pitch-scale modiication algorithms. Informal listening tests suggest that convincing voice conversion is achieved while maintaining high speech quality. The performance of the proposed system is also evaluated on a standard Gaussian mixture model based speaker identiication system, and the results show that the transformed speech is assigned higher likelihood by the target speaker model when compared to the source model.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A DYNAMIC PROGRAMMING APPROACH TO CONTEXT−FREE VOICE TRANSFORMATION (MonAmOR3)

In this paper, we present a dynamic programming approach to voice transformation (VT). The goal of VT is to modify the speech of a source speaker such that it is perceived as if spoken by a target speaker. The speech model used in this work is based on MELP (Mixed Excitation Linear Prediction) speech coding algorithm. The designed system obtains speaker−specific codebooks of line spectral frequ...

متن کامل

A novel voice conversion system based on codebook mapping with phoneme-tied weighting

This paper presents a novel voice conversion system based on codebook mapping. A new phoneme-tied weighting strategy is proposed to reduce the smoothing effects in weighted sum of code books, while a new prosodic conversion method by decision tree is proposed to cope with the complex prosody of Chinese. STRAIGHT algorithm is used to decompose spectrum and excitation for separate modification. L...

متن کامل

不需平行語料而基於共振峰與線頻譜頻率映對之語者特質轉換系統 (A Voice Conversion System based on Formant and LSF Mapping without Using Parallel Corpus) [In Chinese]

Voice conversion has been used in many applications. The methods based on vector quantization codebook and Gaussian mixture models need dynamic time warping on parallel sentence corpus for generating mapping functions. Recent study tries to use less training data, and even without parallel sentence corpus. This paper presents a voice conversion method without using parallel sentence corpus. It ...

متن کامل

A Hybrid GMM and Codebook Mapping Method for Spectral Conversion

This paper proposes a new mapping method combining GMM and codebook mapping methods to transform spectral envelope for voice conversion system. After analyzing overly smoothing problem of GMM mapping method in detail, we propose to convert the basic spectral envelope by GMM method and convert envelope-subtracted spectral details by GMM and phone-tied codebook mapping method. Objective evaluatio...

متن کامل

Voice conversion using General Regression Neural Network

The objective of voice conversion system is to formulate the mapping function which can transform the source speaker characteristics to that of the target speaker. In this paper, we propose the General Regression Neural Network (GRNN) based model for voice conversion. It is a single pass learning network that makes the training procedure fast and comparatively less time consuming. The proposed ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1997